Reinforcement theory

Results: 290



#Item
21Policy Gradient Coagent Networks  Philip S. Thomas Department of Computer Science University of Massachusetts Amherst Amherst, MA 01002

Policy Gradient Coagent Networks Philip S. Thomas Department of Computer Science University of Massachusetts Amherst Amherst, MA 01002

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2013-11-16 15:49:43
22Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning  Philip S. Thomas Dhruva Tirumala Emma Brunskill Carnegie Mellon University

Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning Philip S. Thomas Dhruva Tirumala Emma Brunskill Carnegie Mellon University

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2016-06-06 11:57:32
23Journal of Economic Theory – 198 www.elsevier.com/locate/jet Self-tuning experience weighted attraction learning in games夡 Teck H. Hoa,∗ , Colin F. Camererb , Juin-Kuan Chongc, d

Journal of Economic Theory – 198 www.elsevier.com/locate/jet Self-tuning experience weighted attraction learning in games夡 Teck H. Hoa,∗ , Colin F. Camererb , Juin-Kuan Chongc, d

Add to Reading List

Source URL: people.hss.caltech.edu

Language: English - Date: 2007-09-04 15:38:46
24Detachment: Neither Kind, Nor Unkind There are valuable principles we can learn from the Al-Anon program. Al-Anon teaches men and women to live with alcoholics, both practicing and sober. These people have learned to liv

Detachment: Neither Kind, Nor Unkind There are valuable principles we can learn from the Al-Anon program. Al-Anon teaches men and women to live with alcoholics, both practicing and sober. These people have learned to liv

Add to Reading List

Source URL: respectmerules.com

Language: English - Date: 2016-04-08 23:06:03
25Language Change and the Force of Innovation Roland M¨ uhlenbernd1 and Jonas David Nick2 1  2

Language Change and the Force of Innovation Roland M¨ uhlenbernd1 and Jonas David Nick2 1 2

Add to Reading List

Source URL: www.sfs.uni-tuebingen.de

Language: English - Date: 2014-05-09 15:45:31
26Microsoft Word - Final Paper v3.docx

Microsoft Word - Final Paper v3.docx

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2013-11-16 15:52:50
27Incentive Compatibility of Bitcoin Mining Pool Reward Functions Okke Schrijvers, Joseph Bonneau, Dan Boneh, and Tim Roughgarden Stanford University  Abstract. In this paper we introduce a game-theoretic model for reward

Incentive Compatibility of Bitcoin Mining Pool Reward Functions Okke Schrijvers, Joseph Bonneau, Dan Boneh, and Tim Roughgarden Stanford University Abstract. In this paper we introduce a game-theoretic model for reward

Add to Reading List

Source URL: theory.stanford.edu

Language: English - Date: 2016-02-14 00:07:23
28CS261: A Second Course in Algorithms Lecture #11: Online Learning and the Multiplicative Weights Algorithm∗ Tim Roughgarden† February 9, 2016

CS261: A Second Course in Algorithms Lecture #11: Online Learning and the Multiplicative Weights Algorithm∗ Tim Roughgarden† February 9, 2016

Add to Reading List

Source URL: theory.stanford.edu

Language: English - Date: 2016-02-16 16:06:28
29Microsoft Word - hierarchical nash q learning in continuous games.docx

Microsoft Word - hierarchical nash q learning in continuous games.docx

Add to Reading List

Source URL: www.csse.uwa.edu.au

Language: English - Date: 2009-02-05 01:17:43
30J. Fluid Mech), vol. 789, pp. 726–749. doi:jfmc Cambridge University Press 2016

J. Fluid Mech), vol. 789, pp. 726–749. doi:jfmc Cambridge University Press 2016

Add to Reading List

Source URL: www.cse-lab.ethz.ch

Language: English - Date: 2016-02-10 05:23:25